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MONADIC SECOND ORDER FINITE SATISFIABILITY AND 
UNBOUNDED TREE-WIDTH 

TOMER KOTEK, HELMUT VEITH, FLORIAN ZULEGER 


Abstract. The finite satisfiability problem of monadic second order logic is 
decidable only on classes of structures of bounded tree-width by the classic 
result of Seese m- We prove that the following problem is decidable: 

Input: (i) A monadic second order logic sentence a, and (ii) a sen¬ 
tence /3 in the two-variable fragment of first order logic extended 
with counting quantifiers. The vocabularies of a and /3 may inter¬ 
sect. 

Output: Is there a finite structure which satisfies a A /3 such 
that the restriction of the structure to the vocabulary of a has 
bounded tree-width? (The tree-width of the desired structure is 
not bounded.) 

As a consequence, we prove the decidability of the satisfiability problem by a 
finite structure of bounded tree-width of a logic extending monadic 

second order logic with linear cardinality constraints of the form \Xi \ -!-•■■ + 
|Ar| < 111 I -h ■ • • + I on the variables Aj, Yj of the outer-most quantifier 
block. We prove the decidability of a similar extension of WSIS. 


1. Introduction 

Monadic second order logic (MSO) is among the most expressive logics with 
good algorithmic properties. It has found countless applications in computer sci¬ 
ence in diverse areas ranging from verification and automata theory [UllsillS] to 
combinatorics nsim, and parameterized complexity theory lii. 

The power of MSO is most visible over graphs of bounded tree-width, and with 
second order quantifiers ranging over sets of edge^ (1) Courcelle’s famous theorem 
shows that MSO model checking is decidable over graphs of bounded tree-width 
in linear time OH]. (2) Finite satisfiability by graphs of bounded tree-width is 
decidable [S] (with non-element ary complexity) - thus contrasting Trakhtenbrot’s 
undecidability result of first order logic. (3) Seese proved [23] that for each class Jf 
of graphs with unbounded tree-width, finite satisfiability of MSO by graphs in is 
undecidable. Together, (2) and (3) give a fairly clear picture of the decidability of 
finite satisfiability of MSO. It appeared that (3) gives a natural limit for decidability 
of MSO on graph classes. For instance, finite satisfiability on planar graphs is 
undecidable because their tree-width is unbounded. 

While Courcelle and Seese circumvent Trakhtenbrot’s undecidability result by 
restricting the classes of graphs (or relational structures), several other research 
communities have studied syntactic restrictions of first order logic. Modal logic [26] . 
many temporal logics m, m Chapter 24], the guarded fragment [S], many de¬ 
scription logics [2], and the two-variable fragment m are restricted first order 

^The logic we denote by MSO is denoted MS 2 by Courcelle and Engelfriet [6]. 
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logics with decidable finite satisfiability, and hundreds of papers on these topics 
have explored the border between decidability and undecidability. While many of 
the earlier papers exploited variations of the tree model property to show decid¬ 
ability, recent research has also focused on logics such as the two-variable fragment 
with counting [H] [22], where finite satisfiability is decidable despite the ab¬ 
sence of the tree model property. In a recent breakthrough result, Charatonik and 
Witkowski |3] have extended this result to structures with built-in binary trees. 
Note that this logic is not a fragment of first order logic, but more naturally un¬ 
derstood as a very weak second order logic which can express one specific second 
order property - the property of being a tree. 

Our main result is a powerful generalization of the seminal result on decidabil¬ 
ity of the satisfiability problem of MSO over bounded tree-width and the recent 
theorem by |3]: We show decidability of finite satisfiability of conjunctions a A /? 
where a is in MSO and P is in by a finite structure 971 whose restriction to the 
vocabulary of a has bounded tree-width. (Theorem]^ in Section]^ 

Let us put this result into perspective: 

• The MSO decidability problem is a trivial consequence by setting /3 to true; 
Charatonik and Witkowski’s result follows by choosing a to be an MSO 
formula which axiomatizes a d-ary tree, which is a standard construction [Bj. 

• The decidability of model checking a A f3 over a finite structure is a much 
simpler problem than ours: We just have to model check a and /3 one after 
the other. In contrast, satisfiability is not obvious because a and /3 can 
share relational variables, running two finite satisfiability algorithms for 
the two formulas independently may tield two models which disagree on 
the shared vocabulary. Thus, the problem we consider is similar in spirit 
to (but technically very different from) Nelson-Oppen [20] combinations of 
theories. 

• Our result trivially generalizes to Boolean combinations of sentences in the 
two logics. 

Proof Technique. We show how to reduce our satisfiability problem for a A /? 
to the finite satisfiability of a C^-sentence with a built-in tree, which is decidable 
byi. The most significant technical challenge is to eliminate shared binary relation 
symbols between a' and /?. Our Separation Theorem overcomes this challenge by 
an elegant construction based on local types of universe elements and a coloring 
argument for directed graphs. The second technical challenge is to replace the 
MSO-sentence a with an equi-satisfiable C^-sentence a'. To do so, we apply tools 
including the Feferman-Vaught theorem for MSO and translation schemes. 

Monadic Second Order Logic with Cardinalities. Our main theorem imply 
new decidability results for monadic second order logic with cardinality constraints, 
i.e., expressions of the form |Afi| -|- .. .\Xr\ < IWI -f where the Xi and 

Yi are monadic second order variables. Klaedtke and Ruefi m showed that the 
decision problem for the theory of weak monadic second order logic with cardinality 
constraints of one successor (WSIS'^^'^'^) is undecidable; they describe a decidable 
fragment where the second order quantifiers have no alternation and appear after 
the first order quantifiers in the prefix. Our main theorem implies decidability of 
a different fragment of WSIS with cardinalities: The fragment consists 

of formulas 3Xip where the cardinality constraints in if involve only the monadic 
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second order variables from X, cf. Theorem in Section Note that in contrast 
to [15], our fragment is a strict superset of WSIS. 

For WS2S, we are not aware of results about decidable fragments with cardinal¬ 
ities. We describe a strict superset of MSO whose satisfiability problem over finite 
graphs of bounded tree-width is decidable, and which is syntactically similar to the 
WSIS extension above. 

Expressive Power over Structures. Our main result extends the existing body 
of results on finite satisfiability by structures of bounded tree-width to a signifi¬ 
cantly richer set of structures. The structures we consider are C^-axiomatizable 
extensions of structures of bounded tree-width. For instance, we can have intercon¬ 
nected doubly-linked lists as in Fig. [^a), or a tree whose leaves are connected in a 
chain and have edges pointing to any of the nodes of a cyclic list as in Fig. 0b)- 
Such structures occur very naturally as shapes of dynamic data structures in pro¬ 
gramming - where cycles and trees are containers for data, and additional edges 
express relational information between the data. The analysis of semantic relations 
between data structures served as a motivation for us to investigate the logics in 
the current paper |3]. 

Being a cyclic list or a tree whose leaves are chained can be expressed in MSO 
and both of these data structures have tree-width at most 3. We can compel the 
edges between the tree and the cyclic list to obey C^-expressible constraints such 
as: 

• every leaf of the tree has a single edge to the cyclic list; 

• every node of the cyclic list has an incoming edge from at least one leaf of 
the tree; or 

• any two leaves pointing to the same node of the cyclic list agree on mem¬ 
bership in some unary relation. 

Note that while the structures we consider may contain grids of unbounded sizes 
as subgraphs, the logic cannot axiomatize them. 

2. Background 

This section introduces basic definitions and results in model theory and graph 
theory. We follow dH and IS]. 

The two-variable fragment with counting is the extension of the two- 
variable fragment of first order logic with first order counting quantifiers 3-", 3-”, 
3^", for every n G N. Note that remains a fragment of first order logic. 
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Figure 2. 


Monadic Second order logic MSO is the extension of first order logic with 
set quantifiers which can range over elements of the universe or subsets of rela¬ 
tion^ Throughout the paper all structures consist of unary and binary relations 
only. Structures are finite unless explicitly stated otherwise (in the discussion of 
WSIS). Let C be a vocabulary (signature). The arity of a relation symbol C € C 
is denoted by arity(C). The set of unary (binary) relation symbols in C are un(C) 
(bin(C)). We write MSO(C) for the set of MSO-formulas on the vocabulary C. The 
quantifier rank of a formula tp S MSO, i.e. the maximal depth of nested quantifiers 
in (f is denoted qr((^). We denote by 2liU2l2 the disjoint union of two C-structures 
2li and 2 I 2 . Given vocabularies Ci Q C 2 , a C 2 -structure 212 is an expansion of a 
Ci-structure 2ti if 2li and 212 agree on the symbols in Ci; in this case 2li is the 
reduct of 2 I 2 to Ci, i.e. 2li is the Ci-reduct of 212- We denote the reduct of 2 I 2 to 
Cl by 212 |ci ■ A C-structure 21o with universe Aq is a substructure of a C-structure 
2ti with universe Ai if Aq C Ai and for every R G C, 0 \Ye 

say that 2lo is the substructure of 2li generated by Aq. 

Graphs are structures of the vocabularjj^ Cg = (s) consisting of a single bi¬ 
nary relation symbol s. Graphs are simple and undirected unless explicitly stated 
otherwise. Tree-width tw(G) is a graph parameter indicating how close a simple 
undirected graph G is to being a tree, cf. [B]. It is well-known that a graph has 
tree-width at most k iff it is a partial fc-tree. A partial fc-tree is a subgraph of a 
fc-tree. fc-trees are built inductively from the {k + l)-clique by repeated addition 
of vertices, each of which is connected with k edges to a /c-clique. The Gaifman 
graph Gaif(2l) of a C-structure 21 is the graph whose vertex set is the universe 
of 21 and whose edge set is the union of the symmetric closures of for every 
C G bin(C). Note the unary relations of 21 play no role in Gaif(2l). The tree- 
width tw(2l) of a C-structure 21 is the tree-width of its Gaifman graph. In this 
paper, tree-width is a parameter of finite structures only. Fix k G N for the rest of 
the paper, k will denote the tree-width bound we consider. 

We introduce the notion of oriented fc-trees which refines the notion of fc-trees. 
Let TZ = {i?i,..., i?fc} be a vocabulary consisting of binary relation symbols. An 
oriented fc-tree is an 7?.-structure 21 in which all i?f* are total functions and whose 
Gaifman graph Gaif(9I) is a partial fc-tree. 


^On relational structures, MSO is also known as Guarded Second Order logic GSO. The 
results of this paper extend to CMSO, the extension of MSO with modular counting quantifiers. 

^Since we explicitly allowed quantification over subsets of relations for MSO, we do not view 
graphs and structures as incidence structures, in contrast to [6] Sections 1.8.1 and 1.9.1]. 
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Lemma 1. Every C-structure dJt of tree-width k can be expanded into a (C U TZ)- 
structure such that: (i) is an oriented k-tree, (ii) Gaif(9T) is a subgraph of 
Gaif(*Jl|K), and (Hi) the tree-width is k. 

The oriented 2-tree in Fig.j^b) is an expansion of the directed graph in Fig.j^a) 
as guaranteed in Lemma In Fig. [^b), Ri and i ?2 are denoted by the dashed 
arrows and the dotted arrows, respectively. There are several other oriented fc-trees 
which expand Fig.[^a) and fulfill the requirements in Lemma 

To see that Lemmaholds, we describe a construction of 91 echoing the process 
of constructing fc-trees above. For each vertex u of the initial {k-\- l)-clique, we can 
set the values of ..., Rf)}{u) to be the other k vertices of the clique. When 

a new vertex u is added to the fc-tree, k edges incident to it are added. We set 
..., R'^iu) to be the set of vertices incident to u. For oriented fc-trees whose 
Gaifman graph is not a fc-tree the construction of an oriented fc-tree is augmented 
by changing the value of R'piu) to R^{u) = u whenever Rf'{u) is not well-defined. 
This can happen when the target of u under is a vertex which was eliminated 
by taking the subgraph of a /c-tree to obtain the partial A:-tree. 

3. Overview of the Main Theorem and its Proof 

The precise statement of the main theorem is as follows: 

Theorem 1 (Main Theorem). LetCbnd andCunb be vocabularies. Lets be a binary 
relation symbol not in Cbnd ^Cunb- Let a G MSO(C/,nd) o^nd /? S C^{Cunb)- There is 
an effectively computable sentence S G C^{'D) over a vocabulary R ^ {s} such that 
the following are equivalent: 

(i) There is a {Cbnd^Cunb)-structure 991 such that 991 |= aA/3 and tw(99l|ci,„,i) < k. 

(ii) There is a V-structure 91 such that 91 |= ^ and is a binary tree. 

The first step towards proving Theorem is the Separation Theorem: 

Theorem 2 (Separation Theorem). Let Cbnd iind Cunb be vocabularies. Let a G 
MSO(Cf,„(i) o,nd (3 G C^{Cunb)- There are effectively computable sentences a' G 
M.SO{Vbnd) andp' G C^Vunb) over vocabularies Vbnd and Vunb such that Vbnd H 
'Tunb only contains unary relation symbols and the following are equivalent: 

(i) There is a {Cbnd U Cunb)-structure 991 with dJl \= a A /d and tw(9Jl|ct„j) < k 

(ii) There is a {Vbnd U Vunb)-structure 91 with ‘n\= a' A j3' and tw(91|x)6„d) < k. 

In conjunction with Theorem we only need to prove Theorem [l] in the case 
that the MSO-formula a and the C^-formula (3 only share unary relation symbols. 
The significance of Theorem is that it allows us to use tools designed for MSO 
in our more involved setting. The proof of Theorem uses notions of types for 
C^-sentences in Scott normal form, oriented fc-trees, coloring arguments, and an 
induction on ranks of structures. Theoremj^is discussed in Section]^ The next step 
is to move from structures whose reducts have bounded tree-width to structures 
which contain a binary tree. 

Lemma 2. Let Cbnd and Cunb be vocabularies such that Cbnd CiCunb contains only 
unary relation symbols. Let s be a binary relation symbol. There is a vocabulary 
Vbnd consisting of s and unary relation symbols only, and, for every a G MSO{Cbnd) 
and (3 G C‘^{Cunb), effectively computable sentences a' G MSO(I9{,„£i) and (3' G 
C^{Vbnd Cl Cunb) such that the following are equivalent: 
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(i) There is a (Cbnd U Cunb)-structure 971 such that 971 ^ a A /3 and such that 
tw(97t|cj„J < k. 

(ii) There is a {'Dbnd^Cunb)-structure 9^ such that 9t |= a'A/3' and is a binary 
tree. 

Technically, Lemmaj^is proved using a translation scheme which maps structures 
with a binary tree into structures whose C&nd-reducts have tree-width at most k, and 
conversely, each of the latter structures is the image of a structure with a binary 
tree under the translation scheme. Translation schemes capturing the graphs of 
tree-width at most k as the image of labeled trees were studied in the context of 
decidability and model checking of MSO [1]. We need a more refined construction 
to ensure that the translation scheme also behaves correctly on C^-sentences, i.e. 
that it maps C^-sentences to C^-sentences, see Lemma [ 2 ] in Section 

Now that we have reduced our attention to the case that our structures contain 
a binary tree, we can replace MSO-sentences with equi-satisfiable C^-sentences. 

Lemma 3. Let C be a vocabulary which consists only of a binary relation symbol s 
and unary relation symbols. Let a be an M.SO(C)-sentence. There is an effectively 
computable C‘^{'D)-sentence 7 over a vocabulary V fL C such that for every C- 
structure 971 in which is a binary tree the following are equivalent: 

(i) dJfl \= a (ii) There is a V-structure 9T expanding 971 such that 93 ^ 7 . 

For the proof of Lemma we use a Feferman-Vaught type theorem which states 
that the Hintikka type (i.e. MSO types) of a binary tree labeled with unary relation 
symbols depends only on the Hintikka types of its children. We can therefore 
axiomatize in that the Hintikka type of the labeled binary tree implies a given 
MSO-sentence. 

Having replaced the MSO-sentence in statement (ii) of Lemma with a C^- 
sentence, we are left with the problem of deciding whether a C^-sentence is satisfi- 
able by a structure in which a specified relation is a binary tree, which has recently 
been shown to be decidable: 

Theorem 3 (Charatonik and Witkowski |3]). LetC be a vocabulary which contains 
a binary relation symbol s. Given a C'^{C)-sentence (p, it is decidable whether (p is 
satisfiable by a structure 971 in which is a binary tree. 

4. Separation Theorem 

4.1. Basic Definitions and Results. 

1-types and 2-types. We begin with some notation and definitions in the spirit 
of the literature on decidability of C^, cf. e.g. nan. Let M be a vocabulary of 
unary and binary relations. 

A 1-type TT is a maximal consistent set of atomic M-formulas or negations of 
atomic A-formulas with free variable x, i.e., exactly one of A{x) and -^A(x) belongs 
to TT for every unary relation symbol A € A, and exactly one of B{x, x) and ^B{x, x) 
belongs to tt for every binary relation symbol B G A. We denote by c\)ax^{x) = 
Algtt '' formula that characterizes the 1-type tt. We denote by l-Types(M) the 
set of 1 -types over A. 

A 2-type A is a maximal consistent set of atomic A-formulas or negations of 
atomic A-formulas with free variables x and y, i.e., for every z G {x,y} and unary 
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relation symbol A, exactly one of A{z) and -^A{z) belongs to A, and for every 
Zi,Z 2 € {x,y} and binary relation symbol B £ A, exactly one of B{zi,Z 2 ) and 
^B{zi,Z 2 ) belongs to A. We note that the equality relation « is also part of a 2- 
type. We write A“^ for the 2-type obtained from A by substituting all occurrences of 
X resp. y with y resp. x. We write A^; for the 1-type obtained from A by restricting 
A to formulas with free variable x. We write Xy for the 1-type obtained from A by 
restricting A to formulas with free variable y and substituting y with x. We denote 
by c^ar;^(a:, y) = Ate a '' formula that characterizes the 2-type A. We denote by 
2-Types(Al) the set of 2-types over A. 

Let 911 be a Al-structure. We denote by l-tp®'(u) the unique 1-type tt such 
that 911 1= c()ar^(u). For elements u,v of 911, we denote by 2-ip™{u,v) the unique 
2-type A such that 911 ^ chor;,(u,u). We denote by 2-tp(911) = {2-tp^{u,v) \ 
u, V elements of 911} the set of 2-types realized by 911. The following lemma is easy 
to see: 


Lemma 4. Let 911i,91l2 be two A-structures over the same universe M and let 
(j) = \/x, y. X G C‘^{A) with X quantifier-free. If .g-tp(911i) = ^-ip(91l2), then 911i \= 

*#»t2 h A 

(See proof in Appendix 8.4 ) 


Scott Normal Form and T-functionality. C^-sentences have a Scott-Normal 
Form, cf. |12| . which can be obtained by iteratively applying Skolemization and 
introducing new predicates for subformulas, together with predicates ensuring the 
soundness of this transformation: 


Lemma 5 (Scott Normal Form, [H]). For every C'^-sentence fi there is a C^- 
sentence fi' of the form 

(1) Vx, y.xX f\ Vx. 3=^y. S^{x, y), 

iG[i] 

with X quantifier-free, over an expanded vocabulary such that fi and fi' are equi- 
satisfiable. Moreover, fi' is computable. The expanded vocabulary contains in par¬ 
ticular the fresh binary relation symbols in S = {Si,..., Si} 

Let T be a set of binary relation symbols. We say a structure 911 is T-functional, 
if for every T G T, T™ is a total function on the universe of 911. Observe the 
following are equivalent for every structure 911: 

(i) 911 satisfies Eq. Q, and (ii) 911 A Vcc, y. x and 911 is 5-functional. 

Message Types and Chromaticity. Let T C bin(Al) be a subset of the binary 
relation symbols of A. We write A G T-MsgTypes(Al) and say A is a T-message 
type, if A G 2-Types(Al) and T{x,y) G A for some T gT. Let 911 be a Al-structure 
with universe M. We define = {{u,v) G \ there is a T G T with 911 A 
T{u,v)}. The T-message-graph is the directed graph G™ = {M,E), where 
E = {{u,v) G AP \ u V and {u,v) G o E^}, where Ro S = {{a,b) \ 
there is a c with (a, c) G R and (c, b) G 5} denotes the usual composition of rela¬ 
tions. We say 911 is T-chromatic, if l-tp®^(u) A l-tp®^('y) for all {u,v) G E. 

We note that if 911 is T-functional, then has out-degree deg^{u) < |Tp for 
all u G M. This allows us to prove Lemma [phased on Lemma the proofs can be 
found in Appendices |8.6| and |8.5[ 
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Lemma 6. There is a finite set of unary relations symbols colorsT such that every 
T-functional A-structure can be expanded to a T-chromatic {A'Oco\oT:S'j-)-structure. 

Lemma 7. Let G = (V, E) be a directed graph with out-degree deg^ {v) < k for all 
V G V. Then, the underlying undirected graph has a proper 2k + 1-coloring. 

4.2. Separation Theorem. Let G = (V,E) be an (undirected) graph. We say G 
is fc-bounded, if the edges of G can be oriented such that every node of G has 
out-degree less than k. We say a structure 9Jl is A:-bounded if its Gaifman graph is 
fc-bounded. 

Theorem (Separation Theorem). Let k be a natural number. LetChnd and Cunb 
be vocabularies. Let a G MSO{Ci,nd) antQ ^ G C^{Cunb)- There are effectively 
computable sentences a' G MSO(I?f,„(i) and j3' G (!?„„{,) over vocabularies Vtud 
and Dunb such that D^nd H only contains unary relation symbols such that for 
every k-bounded graph G the following are equivalent: 

(i) There is a {Cbnd U Cunb)-structure 9^ with ^ a A/3 and Gaif(9Jl|ct„d) = G. 

(ii) There is a {Vbnd^'Dunb)-structure 04 with 04 )= a' Afi' and Gaif(04|x)t„j) = G. 

We assume that /3 is in the form given in Eq. 0 for some set of binary relation 
symbols S = {S'!,..., S';} C Cunb and quantifier-free C^-formula x- Let TZ = 
{Ri ,..., Rk} be a set of fresh binary relation symbols. We set T = S LlTZ. We 
begin by giving an intuition for the proof of the Separation Theorem in three stages. 

1. Syntactic separation coupled with semantic constraints. For a binary 
relation symbol B, we define its copy as the relation symbol B. For every vocabulary 
A, we define its copy A = un(y4) U {i3 | B G .4} to be the unary relation symbols of 
A plus the copies of its binary relations symbols. We assume that copied relation 
symbols are distinct from non-copied symbols, i.e., bin(.4) H bin(.4) = 0. For a 
formula Lp over vocabulary A, we define its copy ip over vocabulary A as the formula 
obtained from ip by substituting every occurrence of a binary relation symbol B G A 
with B. 

The sentences a (the copy of a) and /3 do not share any binary relation symbols. 
Glearly, (i) from Theorem holds iff 

(I) dA/3 is satisfied by a ((C^nd UCun,;,) U (Cb„d UC„„b))-structure 04 with B'^ = B 
for all B G bin(C(,„d) and Gaif(9J4|ci,„J = G. 

In the next two stages we will construct ol and fi' so that (I) is equivalent to (ii) 
from Theorem]^ More precisely, we will construct sentences pbnd, hunb G G^ifDunb) 
with Vunb 3 Cbnd u Cunb and Vbnd = Runb such that (I) is equivalent (II): 

(II) (a A JTbEd) A (fi A pLunb) is satisfied by a {Vbnd U T>„„f,)-structure 04 with 
Gaif(04b,„J = G. 

2. Representation of fc-bounded structures using functions and unary 
relations. Theoremj^as well as (I) and (II) involve reducts which are fc-bounded 
structures, fc-bounded .4-structures 21 can be represented by introducing new binary 
relation symbols interpreted as functions and new unary relation symbols as follows, 
(a) We add k fresh relation symbols TZ = {Ri,..., Rk) and axiomatize that these 

relations are interpreted as total functions. 

^The Separation Theorem remains correct if we replace with any logic containing which 
is closed under conjunction. 
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(b) We add fresh unary relations {Pa | A G P-MsgTypes(y^)} and axiomatize that 
every element labeled by P\ has an outgoing edge with 2-type A. The symbols 
Pa are called unary 2-type annotations. 

(c) We axiomatize that Gaif(2l|bin(A)) = Gaif( 2 l|K). 

In other words, the functions interpreting witness that 21 can be 

oriented so that every node in the Gaifman graph of 21 has outdegree at most k. 
The 2-type of each edge {u, v) in 21 is encode by putting the unary relation symbol 
Pa of the 2 -type of (u, v) on the source u in the orientation. 

Given a {{Cbnd UC„„b) U {Cbnd UC„„;,))-structure 2d, we will use the above repre¬ 
sentation twice, on and by axiomatizing that every element labeled by 

Pa has an outgoing edge with 2-type A and an outgoing edges with 2-type A. This 
will allow us to replace the condition from (I) that = B for all B G hin(Cb„d) 
with the condition that Rf^ = Ri for all Ri G TZ. We will define a vocabulary 
^ 2 Cbnd U Cunb U TZ. Let T’-jz = {P\ I A G P-MsgTypes(f)}. We will define two 
vocabularies P^unh ^ Cbnd ZiCnnb UPUPt^ and TTbnd — TZ^unb- According to (a), (b), 
and (c), we will construct VbndiVunb G C'^i'Dunb) such that (I) equivalent to the 
following: 

(P) (aA Vbnd ) A {PAi/unb ) is Satisfied by a PfondUPti„(,-structure 9Jl with Pf' = 
for all Ri gTZ and Gaif( 9 Il| 7 ?,) = G. 

3. Establishing the semantic condition of (I’) by swapping edges. Here 
we discuss how to show the implication from (II) to (P) and make the vocabularies 
Vbnd and Vunb precise. Let 21 be a {Vbnd U Pimj,{-structure with 21 |= {a A yibnd) A 
(/? A pLunb)- It simplifies the discussion to split a {Vbnd U {-structure 21 into 
two PimS-structures. The P„„6-structure £ is 2l|x)„„i,. The P„„6-structure £' is 
obtained from 2I|x)i,„j by renaming copies of relation symbols B to B — i.e., we 
define the Ptm^-structure £' by setting l-tp'*' {u) = l-tp^(u{ for all u G M and 
setting £' ^ B{u,v) iff 21 ^ B{u,v) for all u,v G M and B G hiii{Vbnd)- The 
interpretations of the relations Ri might differ in £ and £'. Observe that we have 
Z' \= a and Z \= (5. The key idea of the proof is prove the existence of a sequence 
of structures £ = £o,..., £p, where each 21^+1 is obtained from 21^ by swapping 
edges, until the interpretations of the relations Ri agree in Zp and Z'. The edge 
swapping operation is a local operation which involves changing the 2-types of at 
most 4 edges. 

The edge swapping operation satisfies two crucial preservation requirements: 
edge swapping preserves (PR-1{ the truth value of P, i.e. Zp ^ /3, and (PR-2{ 
P-functionality. The universal constraint Va;, i/. y in /3 is maintained under edge 
swapping because this operation does not change the set of 2-types (see Lemma |^. 
To satisfy the preservation requirements (PR-1{ and (PR-2{, all that remains is to 
guarantee the existence of a sequence of edge swapping preserving T-functionality. 
Note that T-functionality amounts to T-functionality and 5-functionality. We use 
two main techqniues for ensuring the preservation of T-functionality: chromaticity 
and unary 2-type annotations. We will axiomatize that the structures £ and £' 
are chromatic and we will take care that chromaticity is maintained during edge 
swaps. We will add fresh unary relation symbols P’s = {-Pa | A G 5-MsgTypes(£’{} 
and axiomatize that every element of 21 labeled by P\ has an outgoing edge with 
2-type A and an outgoing edge with 2-type A. 
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Proof of the Separation Theorem. We now start the formal proof of the Sep¬ 
aration Theorem. Let colorsT- be the vocabulary from Lemma We set £ = 

C6„dUC„„(,U7?.Ucolorsr- We set Vb^d = SUVs^V-r, and Vunb = Next we will 

define formulas i/; € MSO(I?6„d) and /?' S C'^i'Dbnd), and set a' = ip € MSO(I?ti„(,). 
We set Rbnd — ^^^^Cbnd^ b^unb — ^^Cunbt ^bnd — b'bnd^^: and fiunb — b^unb^^i 
Ip = a A ^dbnd, and pi' = pi A fiunb, where: 

^ = A*G[fc]^2;.3=iy. 

e = \/x,y.x!^y ^ (y^^^^^Riix,y)W Riiy,x) B{y,x)'^ 

e = ABGbin(c„„6) ^x.y.x^y-^ {b{x, y) -A V,g[fc] R^{x, y) V R,{y, x)) 

Cbnd — A yx.Px{x) AA 3y.ct}aX)^{x,y) 

A G 7^-MsgTypes(£') or 

(A G T-MsgTypes(£') and A~^ G 7^-MsgTypes(f)) 

Cunb — A yx.Px{x) AA 3y.cl^axx{x,y) 

A G T-MsgTypes(£^) 

V = A '^x,y.Px{x) AT{y,x) A^T'{x,y) ^ ^cl)axx^iy) 

A G T-MsgTypes(£^) 

T,T' ^T: T'{x,y) G A 

5 expresses that each Ri is interpreted as a total function, 6 expresses that 
for every I?„„{,-structure £ with £ ^ 0 we have that Gaif(£| 7 ?,) = Gaif(£|ci,„,i), e 
expresses that for every I?„„{,-structure £ with £ |= e we have that Gaif(£|-p„„i,) 
is a subgraph of Gaif(£| 7 j), Q expresses that for every A G T-MsgTypes(£) with 
A G 7?.-MsgTypes(£) or A G 7^-MsgTypes(£’), I?„„b-structure £ and element u of 
£ we have £ \= P\{u) iff there is an element u of £ such that 2-ip^l'^ {u, v) = X, ( 
expresses that for every A G T-MsgTypes(£’), I?„„b-structure £ and element u of 
£ we have £ ^ Px{u) iff there is an element u of £ such that 2-ip^^^ {u,v) = A, 77 
expresses that for every I?ti„;,-structure £ with £ ^ Cunb, XL is chromatic iff £ ^ 77 . 

The direction “(i) implies (ii)” of the Separation Theorem is not hard: 

Lemma 8. Let G be a k-bounded graph. Let DJI be a (Cbnd U Cunb)-structure with 

1= a Apt and Gaif(9Jt|ci,„,;) = G. Wl can be expanded to a {Dbud Cl Vunb)-structure 

with ^\= a' A pi' and Gaif(fTl|-D6„d) = G. 

Proof. Because of Gaif(®t|cj„,j) = G and G is fc-bounded, we can expand 3Jl to 
a Cbnd C Cunb U 7?.-structure 9Jl such that Gaif(£| 7 ^) = Gaif(£|ct„,i) and Rf^ is a 
total function for all i G [k] (possiW adding self-loops for the relations Ri). Thus, 
'iLR\= 5 A9. According to Lemma IH can be expanded to a chromatic structure 
Wt over vocabulary £ with 3K |= 77 . We expand IH to a I?(,„d-structure £ such that 
for all M G M and A G T-MsgTypes(£’) we have £ |= P\{u) iff there is an element 
u of £ such that {u,v) = A. This definition gives us £ ^ Cunb, and thus 

£ 1= /?'. We expand £ to a {Vbnd U )-structure Dd such that for all u,v G M 
and B G B we have Tt |= B{u, u) iff 9d |= B{u, v) and fff ^ Ri{u, w) or Dd ^ Ri{v, u) 
for some i G [fc]. We note that f)d |= a'. □ 

Now we turn to the direction “(ii) implies (i)”. Let G be a A:-bounded graph. Let 
9d be a {Vbnd U )-structure with ^\= a' A pi' and Gaif(9d|x)t„,i) = G. Let M be 
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the universe of iTl. We define the I?ti„(,-structure £' by setting 1-ip^ (u) = l-tp^(u) 
for all w € M and setting £,' |= B{u,v) iff fll ^ B{u,v) for all u,v G M and 
B G bin(I?(,nd)- We note that \= ^ and Gaif(£') = G. We define the Vunb- 
structure £ by setting £ = We note that £ ^ /?'. 

We make the following definition: For u G M and i G \k] we set rankl^(2., £') = 
1, if there are v,w G M with £ |= Ri{u,v), £' ^ Ri{u,w) and v ^ w; we 
set rankli^{2, 2') = 0, otherwise. We set ranfc„(£,£') = 

rank{2, 2') = ranku{2,2'). rank measures the deviation of the relations 

7^ in £ and £' (we note that there always are unique v,w G M ior u G M with 
£ ^ Ri{u,v), £' 1= Ri{u,w) because of £ ^ 5 and £' ^ 6) and has the following 
important property: 

Lemma 9. If rank{2, £') = 0, then £|cwUC„„t \= a ^ fi and Gaif(£cj„J = G. 

(See Appendix \8.S\ for the proof.) 

The proof of Lemma uses the following simple but useful property of the rank 
function: 

Lemma 10. Let u G M be an element with rank'(^[2, £') = 0 and let A G 2-Types{£) 
with Rj G A. For all v G M we have 2-ip^^^ {u,v) = X iff 2-ip^ ^^{u,v) = A. 

(See Appendix \8.'/\ for the proof.) 

Lemma 11. There is a sequenee of Bunb-structures 2o,..., 2p, with universe M 
and £ = £o, such that: 

(1) l-tp~’(u) = l-tp~ (u) for all u G M, 

(2) 2-{)?{2i) = .2-tp(£i+i) for allO <i <p, 

(3) 2i 1= '0, (in particular 2i is B-functional and chromatic), 

(4) rank{2i,2') > rank{2i+i,2') for all 0 < i < p, and rank(2p, 2') = 0 . 

Proof. Assume we have already defined 2i and rank{2i, 2') >0. In the following 
we will define £^ 4 - 1 . Because of rank{2i,2') > 0 we can choose some elements 
u,v,w G M and j G [A:] such that 2i \= Rj{u,v), £' \= Rj{u,w) and v B ru. Let 
A = 2-ip^’l^ (u, u). We have A G T-MsgTypes(£) because of Rj G TZ. We have 
2i ^ Px{u) because of 2i ^ Cunb- Because l-tp(u)'^’ = l-tp(M)^ we have £' \= 
P\{u). Thus, 2i 1= and £' ^ c()ac;^^(w). With l-tp^*(w) = l-tp'*' (w) 

we get 2i ^ and thus l-tp'*''!^(?;) = 1-ip^’l^ (ic). We proceed by a case 

distinction: 

Case 1: A“^ is a T-message type. 

We have £' \= Px-i{w) because of 2' ^ Cbnd- We get 2i ^ Px-i{w) because of 
l-tp^’(w;) = 1-tp'*' {w). With 2i |= (unb, there is an element a G M such that 
A = 2-tp^‘l'^(a,u>) and 2i ^ Px{a). We note that u a because of 2i |= Rj{a,w), 
V B w and v is the unique element with 2i ^ Rj(u,v) (using 2i |= 5). Moreover, 
2i ^ c(]ar_,^-i(a) and £' |= cf)ac^-i(M). With l-tp'^’(u) = 1-tp^ (u) we get 2i ^ 
cf)ar;^-i( m), and thus l-ip^’l^(M) = l-ip^’l^(a). 

We note that the edges {u,w), {w,u), {a,v) and (u, a) do not have T-message 
types (*) because 2i is chromatic, u ^ a, v B ru, l-tp'*'*!'^(z;) = l-tp^*!*^(w), 
l_tp^»h(u) = l-tp^d£(Q,) and the edges {u,v), (v,u), (a,w) and (w,a) have T- 
message types. Similarly, we get A T 2-tp'*' I® («,■!;) and A ^ 2-tp^ l^(a, w) (**) 
because 2' is chromatic, A = 2-tp'‘' ^^{v,w) and A and A“^ are T-message types. 
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We define £i+i as follows: The unary relations of are defined such that 
we have = l-tp'^‘(w) for all elements u G M. We obtain the binary 

relations of £i+i by swapping the edges (it, w) and (it, v) as well as (a, w) and 
( 0 , 11 ) in £,i: we set (it, ic) = A, {a,v) = A, (a, ic) = 

2_tp^»l£(a,i;) and 2-tp^’+^l^(it,i;) = 2-tp~‘l'^(it,ic); these 2-types are well-defined 
because of l-tp^’l^(T) = l-tp'*'’!'^(ic). All other 2-types in 2,i+i\e are the same as in 
Zi\£. This completes the definition of £i+i. 

We now argue that £^+1 satisfies properties Q-Q. Clearly, £^+1 satisfies Q by 
definition. Because we only swapped edges from Zi to £i+i we have (HI). From ([^, 
Lemma 1^ and Zi |= x, we get £i+i \= x- Because we only swapped swapped edges 
from Zi to £i+i we get Zi+x ^ 9 and £i+i |= e from Zi\= 9 and Zi ^ e. Because of 
(*) and 2-tp-^*l^(ii,ii) = A = 2-tp-^'l'^ (a, ic) we get Zi+i (= (unb from Z^ (= (unb, that 
£i+i is T- functional because Zi is T- functional and £i+i |= rj from £j |= rj (using 
that 1-ip-^’l'^ (u) = l-tp'*'’!'^ (ic) and 1-tp^‘l'^(it) = (a)). Thus, £^+1 satisfies 


property (|^. 

It remains to show property Q. Using (*), (**) and Lemma 10 we get that 
ranku{Zi+i, Z') < ranku{Zi, Z') and rank^{Zi+i, Z') < rankz(Zi, Z') for z G { 11 , 11 ;, a}. 
Moreover, rankz(Zi +i,£') = rankz(Zi, Z') for z G M \ {u,v,w,a}. Thus, we have 
rank(Zi, Z') > rank(Zi+i, Z'). 


Case 2: A ^ is not a T-message type. 

We note that the edge (it;,ii) does not have a T-message type because £,; is chro¬ 
matic, V ^w, l-tp^‘l^(ii) = 1-ip^’l^ (ic) and the edge {u,v) has a T-message type. 

We define £,+1 as follows: The unary relations of £i+i are defined such that we 
have l-tp(it)'^’+i = l-tp(it)~’ for all elements it G M. We obtain the binary relations 
of Zi+i by swapping edges in £u 2-tp'*'’+il^ (u, ic) = A, and 2-tp^‘+il® (m, i;) = 
2-tp^‘l^(it,iii). These 2-types are well-defined because of l-tp~‘l'^ (i;) = 1-tp^'l® (ic). 
All other 2-types in Zi+i\£ are the same as in Zi\£. This completes the definition 
of £i+i. 

We now argue that £^+1 satisfies properties Q-Q. As in the previous case, one 
can argue that Zi+i satisfies Q and ([^, Zi+i ^ Xj •^i+i |= 9 and £i+i ^ e. Because 
(v,u) and (w,u) do not have T-message types we get £i+i ^ Cimb from Zi |= C,unb 
and that £^+1 is T-functional because Zi is T-functional. We get £^+1 |= rj from 
Zi 1= r], Z' (= T] and l-tp^’(iii) = l-tp~ (w). 

It remains to show property Q. From Lemma [lo| we get 2-ip’^’l^ (ii, ic) ^ A 
and 2-tp^ l'^(ii,i;) ^ A. Again applying Lemma |l0f we get ranfc„(£i+i,£') < 
ranku{Zi,Z'). Because 2-tp^*l'^(i>,ii) and 2-tp^‘l^(ic,w) are not T-message types, 
we get ranky(Zi+i, Z') = rankv(Zi, Z') and rankw(Zi+i, Z') = rankyj(Zi,Z'). More¬ 
over, we have that ronfc^(£i+i, £') = rankz{Zi,Z') for all z £ M \ {m,i;,iii}. Thus, 
rank(Zi,Z') > ranfc(£i-|_i,£'). 

□ 


5. From Bounded Tree-width to Binary Trees 

This section is devoted to a discussion of the proof of Lemma First we 
need some background from the literature. A translation scheme for C 2 over 
Cl is a tuple t = (^, i/;c ^ C G C 2 ) of MSO(Ci)-formulas such that (j) has exactly 
one free hrst order variable and the number of free first order variables in each 
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ipc is arity{C). The formulas (j) and ipci C G C 2 , do not have any free sec¬ 
ond order variablesj^ The quantifier rank qr(t) of t is the maximum of the 
quantifier ranks of (j) and the ipc- t is quantifier-free if qr(t) = 0. The in¬ 
duced transduction t* is a partial function from Ci-structures to C 2 -structures 
which assigns a C 2 -structure t*(2l) to a Ci-structure 21 as follows. The universe 
of t*(2l) is = {a G ^ ^ \= The interpretation of C G C 2 in t*(2t) is 

(jt ( 21 ) _ 1^ g j^^anty{C) , ^ '0(;;(a)|. Due to the convention that structures do 

not have an empty universe, t*(2l) is defined iff 21 ^ 3x(j){x). 

Lemma 12 (Fundamental property of translation schemes). Let t be a translation 
seheme for C 2 overCi. There is a computable function from MSO{C 2 )-sentences 
to MSO{Cl)-sentences such that for every Ci-structure 21 for which t*(2l) is defined 
and for every MSO(C 2 )-sentence 0, 2t ^ */ only if t*(2t) \= 9 . t** is called 

the induced translation. 


For an MSO(C 2 )-sentence C, t^* substitutes the relation symbols C G C 2 in C with 
the formulas tfci requires that eac h of the free variables satisfies (j>, and relativizes 
the quantification to (j). Appendix 8.9 gives an inductive definition of t** following 
Definition 3.2 of [T^ . 

From the definition of the induced translation we have: 


Lemma 13 (Quantifier-free translation schemes and C^). Lett be a quantifier-free 
translation scheme forC 2 overCi. The induced translation t** maps {C 2 )-formulas 
to {Cl)-formulas. 

It is well-known that the class of graphs of tree-width at most k is the image of 
an induced transduction on a class of labeled trees [T] . For the proof of Lemma 
we need an analogous translation scheme for {Cbnd U Cu„(, )-structures whose 
reducts ^\ci,nd have tree-width at most k. In order that a and /3 be mapped to 
an MSO(I?f,„d)-sentence and a (C„„f, )-sentence respectively, we need that the 
translation scheme satisfy some additional properties. 


Lemma 14. Let Cbnd ond Cunb be vocabularies such that Cbnd H Cunb only contains 
unary relation symbols. There exist the following effectively computable objects: 
(1) a vocabulary Tbnd consisting of the binary relation symbol s and unary relation 
symbols only, (2) a translation scheme tr = {(j), ifc '■ C G Cbnd U Cunb) for Cbnd^Cunb 
over VbndCt Cunb, and (3) an MSO{'Dbnd)-sentence dom, such that: 

(a) 4> is quantifier-free overVbnd, 

(b) For every relation symbol C G Cunb, f’c is quantifier-free. 

(c) For every relation symbol C G Cbnd, ifc is an MSO{'Dbnd)-formula. 

(d) Let OF be the class of {Vbnd U Cunb)-structures in which s is interpreted as a 
binary tree and which satisfy dom. The image of OF under tr* is exactly the 
class of {Cbnd U Cunb)-structures iTfl such that S!ll|ci,„d has tree-width at most k. 


The proof of Lemma 14 is technically similar to the discussion in [7] . We include 
the proof in the Appendix |8.2| for completeness. 

We are now reacW to prove Lemma By Lemma [l4||d]) and Lemma 12 


ment (i) in Lemma|^holds iff there is a {Vbnd UC„„;,)-structure 91 such that s 


state- 
,01 


IS a 


®A11 translation schemes in this paper are scalar (i.e. non-vectorized). In the notation of [2, 
a translation scheme is a parameterless non-copying MSO-definition scheme with precondition 
formula {x ^ x). 
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binary tree and 91 |= dom A tr** (a) A tr**(/3). Let a’ = dom A tr* (a) and P' = tr**(/3). 

i ^ and the definition of tr^, a' S MSO(I?(,„d). Let tr|c„„i, be the 


14 


By Lemma 

translation scheme for Cunb over 'Df,nd U Cunb which agrees with tr on all formulas, 
i.e. tr|c„„t = {(j),ipc ■ C G Cunb)- The image of on a structure ^)Jl is the Cunb- 

reduct of the image of tr* on SOI . Sin ce P G C^iCunb), tr|c„„/(/3) is well-defined and 

b), is a quantifier-free translation scheme, 


14 


= P'- By Lemma_ 

implying that P' G C^{'Dbnd U Cunb) by Lemma 1 13 


6 . From MSO to on Binary Trees 


The purpose of this section is to show that, on structures consisting only of a 
binary tree and additional unary relations, every MSO-sentence can be rewritten 
to a C^-sentence which is equi-satisfiable and whose length is linear in the length 
of the input MSO-sentence. We start by introducing some tools for the literature. 

Theorem 4 (Hintikka sentences). Let C be a vocabulary. For every g S N there is 
a finite set Jtfjr.yVc,q of MSO{C) of quantifier rank q such that: 

(1) every e G Jify.A"c,q has a model; 

(2) the conjunction of any two distinct sentences ei,e 2 G JiC.^JGc,q is not sat- 
isfiable; 

(3) every MSO [C)-sentence a of quantifier rank at most q is equivalent to ex¬ 
actly one finite disjunction of sentences J^J^-A'c.qi 

(4) every C-structure 21 satisfies exactly one sentence hinc,q(2l) of M’.yrc^q. 
We may omitC or q from the subscript when they are clear from the context. 

For a class of C-structures JfC an n-ary operation Op over C-structures is called 
smooth over JF, if for all 2li,... ,2t„ G JF, hinc,g(Op(2li,... ,2l„)) depends only 
on hinc,q(2li): i G [n] and this dependence is computabl^ We omit “over JF” 
when JF consists of all C-structures. 

Theorem 5 (Smoothness). (1) The disjoint union is smooth. 

(2) For every quantifier-free translation scheme t, the operation t* is smooth. 

(3) Let Ti 'o T 2 denote the operation which augments the disjoint union o/Ti 
and T .2 by adding an edge from the root of tree Ti to the root of tree % 2 - 
This operation is smooth over labeled trees|j 

For an in-depth introduction to Hintikka sentences and smoothness and refer¬ 
ences to proofs see [m Chapter 3, Theorem 3.3.2] and [12] respectively. 

We are now ready for the main lemma of this section. 

Let rt(x) be the sentence Fy ^s{y, x). This sentence defines the root of the binary 
tree s. 

Lemma 15. Let q G N and let C be a vocabulary which consists only of the binary 
relation symbol s and (possibly) additional unary relation symbols. Let Ce : e G 
.jFj:.A"c,q be new unary relation symbols. Let hin(C, q) be the vocabulary which 
extends C with {C^ : e G There is a computable C‘^{hhi{C^q))-sentence 

0]l“ such that if To is a C-structure such that is a binary tree: 


^ Smooth operations here are called effectively smooth in m- 

^We use the smoothness of the disjoint union and quantifier-free transductions. The transduc¬ 
tion adds the edge between the roots. Technically, in order for the transduction to be quantifier- 
free, we need that our trees have a special unary relation symbol root with the natural interpre¬ 
tation as the root of the tree. 










MONADIC SECOND ORDER FINITE SATISFIABILITY AND UNBOUNDED TREE-WIDTH 15 


(i) There is an expansion Ti of%Q to hin(C,g) with Ti ^ ©c™- 
(ii) For every expansion o/Tq to hin(C, q) and for every u) G MSO(C) such that 
1 = and qr(a;) = q, the following holds: there exists a -sentence Whin 
such that To \= Lu iff \= Whin- The -sentence Whin is 

Wx I rt(a;) —>■ \J C^{x) 


The sentence is defined so that for every Tq there is a unique expansion Ti 


such that Ti |= 0^™. For every u in the universe Ti of Ti, we will have u G iff 


the subtree T„ of To whose root is u satisfies T„ \= e. Using the smoothness of o, 
whether an element of Ti belongs to depends only on its children. This can be 
axiomatized in . Lemma|^follows from Lemma If 
and Whin = 7- 

Appendix |8.3| spells out the proof of Lemma 


with q = qr(a), V = hin(C, q), 


7. MSO WITH CARDINALITY CONSTRAINTS 

MSQcard jgj^Q^gg ^]^g gxtension of MSO with atomic formulas called cardinality 
constraints X]i=i 1^*1 < Y^i=i where the Xi and Yi are MSO variables, and 
|X| denotes the cardinality of X. Let WSIS (WS1S'^“‘^) be the weak monadic 
second order theory (with cardinality constraints) of the structure (N,+1, <). Let 
C MSO'^“'^ be the set of sentences p such that (1) p is of the form p = 
3Xi ■ ■ ■ BXmUJ, and ( 2 ) only the Xi,..., Xm participate in cardinality constraints. 

Theorem 6 . Given a sentence p G it is decidable (A) whether (N, +1, <) ^ 

p, and (B) whether p is satisfiable by a finite structure of bounded tree-width. 

This theorem follows from Theorem The main observation needed for (B) is 
that cardinality constraints can be expressed in terms of injective functions, which 
are axiomatizable in C^. (A) is reducible to (B). The main observations for (A) 
are: 

( 1 ) that Xi,..., Aim are contained in the substructure 2li of (N, +1, <) gener¬ 
ated by {0,..., (.} for some £ G N, 

(2) that substructure 2 I 2 of (N, -1-1, <) generated by N —{0, ...,£} is isomorphic 
to (N, -|-1, <), and therefore 2 I 2 and (N, -1-1, <) have the same weak monadic 
second order theory, 

(3) that the weak monadic second order theory of (N, -1-1, <) is decidable, 

(4) and that (N, -|-1, <) is a transduction t of 2li U 212 . 

It remains to use that U is smooth, cf. Appendix |8.1[ 
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8. Appendix 


8.1. Proof of Theorem]^ We start with (B). 

Let p be a sentence, i.e. the outermost block of quantifiers in p is 

existential and only variables from the outermost blocks may appear in cardinality 
constraints. For simplicity we consider p = BX 1 BX 2 U! with only two quantifiers in 
the outermost block. W.l.o.g. the only cardinality constraint in w is \Xi\ < \X 2 \. 
By a slight abuse of notation, we sometimes treat Xi and X 2 as unary relation 
symbols. Let Cbnd extend the vocabulary of p with new unary relation symbols 
Ai,A 2 ,kFimg,Wdom- Let Cunb extend Cbnd with a new binary relation symbol B. 
Finite satisfiability of w by a structure ^)Jl such that tw(91t) < k can be reduced 
to finite satisfiability of a sentence a A (3, a G MSO(C{,„d) and (3 G C^{Cunb), by a 
structure 2li such that tw(2li|cj„j) < k. Let f3 be the C^-sentence 

P = (inji2 V inj2i) A dom A img 


where 

• inj 22 expresses that B is an injective function from Xi to X2, 

• inj 2 i espresses that B is an injective function from X2 to Xi, 

• dom expresses that the domain of B is IFdom) 

• and img expresses that the image of B is Wimg. 

For every -structure 2li, \Xf^ \ < \Xf^ \ iff = Xf^ and ^ 0. 

Let a be obtained from w by substituting every \Xi \ < IA 2 I by 


Vx {Xi{x) GA Wdom(x)) A 3x (^lFii„g(a;) A X2{x)) 

For any Cb„d-structure Sip with tw(2lo) < fcj Sip ^ a; iff there is an expansion 
2li of 2lo such that 2li |= a A (3. The treatment of other cardinality constraints 
J2i=i \^i\ < X]i=i \^i\ similar; it is helpful to assume w.l.o.g. that the Xi and 
the Yi are disjoint. 

Now we turn to (A). The unary function -|-1 is the successor relation of N and 
interprets the binary relation symbol sue. The binary relation < is the natural order 
relation on N. In the proof of (A) we will use the theory of Hintikka sentences as 
presented in Section with one caveat, namely that instead of restricting to finite 
structures, we allow arbitrary structures. Theorems and hold for arbitrary 
structures. Theorems guarantees the existence of a set analogously to 

■^^dP.A'c.q for arbitrary structures. For every C-structure 21, Theorem guarantees 
the existence of a sentence hin"g(2l) of M" analogously to hinc_q(2l). For the 

rest of the section, we omit the superscript arb to simplify notation. 

Consider p = 3Ai3A2a; for the vocabulary Cn of (N, 3-1, <). Let a G MSO(C;,„d) 
and f3 G C^iCunb) be as discussed in the proof of (B) above. The following are 
equivalent: 

1. (N, 3-1, <) 1= p 

2. There are finite unary relations Ui and U 2 such that (N, 3-1, <, l/i, U 2 ) \= oj. Ui 
and U 2 interpret Xi and A 2 respectively. 

3. There are finite unary relations Ui and U 2 and an expansion 21 of (N, 3-1, <, 17i, U 2 ) 
such that % \= a A [3. 21 expands (N, 3-1, <, t/i, C/ 2 ) with interpretations of the 
symbols in Vunb = {B,Wa om 1 Wimg}- 

4. p' = 3 Ai3 A2 (q; a (3) is satisfiable by an expansion of (N, 3-1, <) with interpre¬ 
tations for the symbols of 
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We have 1. iff 2. and 3. iff by the semantics of 3Xi3X2 in weak monadic 
second order logic. We have 2. iff 3. similarly to the discussion of a and /3 in the 
proof of (A) above. The rest of the proof is devoted to proving that 4. is decidable. 

Observe that by the definition of /3, and in weak monadic second order Xi and 
X 2 are quantified to be finite sets, B is axiomatized to be a function with finite 
domain, and Wdom and kkimg are finite. Hence models 21 of p' can be decomposed 
into a finite part containing and an infinite part isomorphic to an expansion of 
(N, +1, <) in which the symbols of Vunb as are interpreted as empty sets. We will 
use a similar decomposition, but first we want to move from the structure (N, +1, <) 
and its expansions to (N, +1) and its expansions. There is a translation scheme t< 
such that for every structure 21 = (N,+1, ITj^g), t^((2l)) = (21, <), 

where (21, <) is the expansion of 21 with <. This is true since < is MSO definable 
from +1. We have that p' is satisfied by an expansion of (N,+1,<) iff t^^{p') is 
satisfied by the same expansion of (N, +1). 

Now we turn to the decomposition of models of tj, {p') into a finite part containing 
and an infinite part isomorphic to a I?tm;,-expansion of (N, +1). Let I ?2 2 
be the vocabulary of t^^{p'). For every I? 2 -expansion fp of (N, +1) and every n S 
N, let and be the substructures of ip generated by [n] and N\[n — 1] 

respectively. There is a translation scheme u such that u*(fPi „ U fPn+i 00) = *P if 
H'P C [n] X [n]. u existentially quantifies the set [n] (which is the only non-empty 
finite set closed under sue and its inverse), 1 and n (as the first and last elements of 
[n]) and n-l -1 (as the only element without a suc-predecessor except for 1 ) and adds 
the edge (n, n-|-l) to sue. We have ip |= t<(/ 9 ') iff iPi^„Ufp„+i_oo ^ ui*(t5, (p')). Note 
that the vocabulary of ui*(t5,(p')) is I? 2 - Let q be the quantifier rank of ul*(t5, (/))■ 
The Hintikka sentence hin(fp„+i_oo) of quantifier rank q of fPn+ 1,00 is uniquely 
defined since iPn+i^oo is isomorphic to the expansion of (N, - 1 - 1 ) with empty sets. 
Moreover, hin(ip„+i^oo) is computable using that the theory of (N,-|-1) is decid¬ 
able. Hence, by the smoothness of the disjoint union, for every Hintikka sentence 
e S M ’there is a computable set C such that hin(*Pi_„ U 

'Pn+1,00) = e iff hin(Pi^„) e E^. Then‘Pi,^ UfPn+i^oo 1 = iff'Pi.™ satisfies 

the sentence \/(e e') where the \/(e e') ranges over pairs (e, e') such that 

(1) e S 

( 2 ) e h u“(t“<(p')), and 

(3) e' eE,. 

Hence, p' is satisfiable by an expansion of (N, -1-1, <) iff \/(f e') ^ satisfiable by a 
finite structure in which sue is interpreted as a successor relation (i.e., as a simple 
directed path on the whole universe). Let suc-rel be the weak MSO-sentence such 
that the interpretation of sue is a successor relation. By Theorem |^B), it is decid¬ 
able whether, d A suc-rel is finitely satisfiable using that the class of simple 

directed paths annotated with unary relations has tree-width 1 . 

Remark 1. While we assumed for simplieity in (B) that Xi and X 2 range over 
subsets of the universe, it is not hard to extend the proof to the case that Xi and X 2 
are guarded second order variables which range over subsets of any relation in the 
structure. This is true since we can use the translation scheme tr from Lemma [7^ 
to obtain the structures of tree-width at most k as the image of tr* of labeled trees; 
Xi and X 2 then translate naturally to monadic second order variables. 
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8.2. Proof of Lemma |14[ This appendix is devoted to the proof of Lemma [T4| 
Appendix |8.2| introduces a tree encoding of structures of tree-width at most k 
based on aligned k-trees. Appendix |8.2| introduces a translation scheme such that 
the image of the induced transduction on an MSO-definable class of labeled binary 
trees is the class of structures of tree-width at most k. This is done based on 
translation schemes introduced in Appendix |8.2| Finally, the proof of Lemma [T^ is 
given in Subsection |8.2| 

Note that the notion of aligned k-trees is quite similar to the notion of oriented 
k-trees defined in Section In fact, we could have replaced aligned A:-trees with 
oriented fc-trees here, or replaced oriented fc-trees with aligned k-trees throughout 
the paper. However, it simplifies the proofs to have two notions. 


Aligned k-tree encodings and (decorated) aligned k-trees. An aligned Ic-tree encod¬ 
ing T is a binary tree whose vertices are labeled by certain unary relations. All 
structures of bounded tree-width can be obtained by applying transductions to 
aligned fc-tree encodings with additional vertex annotations in three steps: 

Step A Aligned k-tree encoding mte^ret Decorated aligned fc-tree 

Step B Decorated aligned k-tree u^decorate Aligned /c-tree 

structurize(;^ 

Step C Aligned /c-tree —> C-structure oi tw < /c 


interpret, undecorate and structurizec (shorthand i, u and sc) are unary 
operations on structures given in terms of translation schemes (see Section 8.2). 
Two key properties of our encoding: 


(a) Using that the number of edges in a fc-tree is at most k ■ (number of vertices), 
our (decorated) aligned fc-trees have functions Ri,... ,Rk rather than an edge 
relation. 

(b) The elements of the universe of a structure appear as elements in the aligned 
fc-tree, in the decorated aligned /c-tree, and in the aligned fc-tree encoding. 
Aligned fc-tree encodings and decorated aligned fc-trees have additional auxil¬ 
iary vertices which are eliminated by undecorate. 


Let V be the vocabulary containing the binary relation symbol s, and the unary 
relation symbols root, Labeli,..., LabeU, Labelbiank- 

An aligned fc-tree encoding is a V-structure T with universe T such that 
(i) {T,s‘^) is a directed tree (i.e., an acyclic directed graph in which all vertices 
have in-degree 1 except exactly one.), (ii) every vertex in (T, s‘^) has out-degree at 
most 2, (hi) Labelf,..., LabelJ, Labelbiank form a partition of T, (iv) root‘d denotes 
the unary relation containing only the unique vertex with in-degree 0 in (T, s”^), 
and (v) all the children of vertices in T\Labelbiank belong to Labelbiank- Fig. [^a) 
shows an aligned 3-tree encoding. The edges represent s. The small black circles 
represent Labelbiank, the larger red circles represent Labeli, the squares represent 
LabeU, and the diamonds represent Labels. 


Lemma 16. There is a -sentence enc such that for every V-structure iXft in which 
s^ is a binary tree, tfft ^ enc iff is an aligned k-tree encoding. 

Proof. Being an aligned fc-tree encoding is expressible in for structures in which 
s is interpreted as a binary tree. Below fx refers to requirement X in the definition 
of an aligned fc-tree encoding, (i and are already taken care of since s is a binary 
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Figure 3. (a) An aligned 3-tree encoding, (b) The decorated 

aligned 3-tree obtained from (a) by applying i*. i?i, and 
are denoted by the dashed arrows, the solid arrows and the (sin¬ 
gle) thick arrow, respectively, (c) The aligned 3-tree obtained 
from (b) by applying u*. (d) A aligned 3-tree 21 expanding (c) 
with unary relations annotating its elements. 21 encodes a directed 
graph G = {V,E). The unary relations are U® and for each 

j G {1,2,3}. They encode, for each element m of 21, whether the 
edge {m,Rj{m)) respectively {Rj{m),m) belongs to E. (e) The 
directed graph obtained from 21 in (d) by applying s*. Gaif(G') has 
tree-width 2. 


tree, enc is given as follows: 


enc 

^iii 


Civ 


^iii ^ ^iv 

Vx I Labelbiank(a^) V \J Labeb(a:) ] A 




i=l 


Vx I \J Labeb(a;) o ^Labelbiank(a;) ) A 


'^i=l 


\/x I y/y (^Labeb(x) V ^Labelj(x)) 
yAi 

Vx (root(x) O yy^s{y, x)) 

VxVy (s(x, y) A ^Labelbiank(a:) Labelbiank)?/)) 


□ 


A decorated aligned A:-tree 21 is an expansion of an aligned A:-tree encoding T 
to7?.U7^. For every} G [fc], i?® contains all pairs (?;, m) G (r\(LabelJuLabelbiank))^ 
LabelJ such that there is a directed path P from m to x in (T, s"^) which does not 
intersect with LabelJ except on m, i.e. LabelJ H {P\{u}) = 0; moreover, for every 
u, if i?® does not contain any pair (it, x), then ii® contains additionally the self 
loop (u,it). Observe that the relation i?® is a total function. Fig. |^b) shows the 
decorated aligned 3-tree obtained from (a). The new edges represent Ri, R 2 , and 
R 3 : dashed edges represent i?i, solid edges represent i? 2 , and the single thick edge 
(from a square to a diamond) represents R 3 . 
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A aligned fc-tre^ is the 7?.-reduct of the substructure of a decorated aligned 
fc-tree SIq generated by Labelf® U • • • U Label®". Fig. ^c) shows the aligned 3-tree 
obtained from (b). The vertices of (c) are not labeled by any unary relation. 

Lemma 17. 

(1) For every aligned k-tree 21, Gaif(2t) is a partial {k — l)-tree. 

(2) For every partial {k — l)-tree G there is an aligned k-tree 2t such that G is a 
subgraph of Gaif (21). 

Proof. For the proof of Lemma we define an aligned fc-tree encoding T to be 
perfect if (1) there are wi,pi, W 2 , ■ ■ ■ ,pk-i,Wk G T such that wi,pi, W 2 , ■ ■ ■ ,pk-i,Wk 
is a directed path, (2) root‘d = {wi}, (3) Wi S Labelf" of all i € [fc], (4) pi G 
Labelbiank for all i G [/c — 1], and (5) the out-degree of each of vertex in the path 
is exactly 1. If T is a perfect aligned fc-tree encoding, then u*(i*(‘L)) is a perfect 
aligned fc-tree. We have: 

(o) For every perfect aligned fc-tree 21, Gaif(2t) is a (fc — l)-tree. 

(o) is proved by induction on the number of vertices in a perfect aligned fc-tree 
encoding T such that u*(i*(T)) = 21. Lemma [T7|(l) follows from (o) by building a 
perfect aligned fc-tree such that 21 is its substructure. 

First we prove: 

(o) For every perfect aligned fc-tree 21, Gaif(2t) is a (fc — l)-tree. 

Let T be a perfect aligned fc-tree encoding such that u*(i*(T)) = 2t. We prove 
the claim by induction on the number of vertices in T. 

• In the base case, T consists only of the vertices wi,pi,... ,pk-i,Wk, and 
u*(F(T)) is a fc-clique, which is a (fc — l)-tree. 

• Let T be a perfect aligned (fc — l)-tree encoding. Let Tq be obtained from 
T by removing a leaf v. By the induction hypothesis, Gaif(i*(To)) is a 
(fc — l)-tree. If ?; G Labely,^^]^, then u*(i*(To)) = u*(i*(T)). Otherwise, let 
V G LabelJ. We denote by B = {m : i G [fc]\{j}} the set of elements such 
that Ui G Labeb and there is a path Pi from Ui to in T satisfying the 
following property: Pi does not visit any vertex in Labeb except Ui. Note 
that for all i G [fc]\{i} Ui exists, and hence |i3| = fc — I, using that Tq is 
perfect. By the choice of B and since z; is a leaf, B is the set of neighbors 
of V in Gaif(u*(i*(T))). It remains to show that i? is a (fc — I)-clique to 
get that Gaif(u*(F(T))) is a (fc — l)-tree. Let ii,i 2 & [^]\{j} with ii 12 . 
Since (T, is a directed tree, either Pi^ C Pi^ or Pi^ C W.l.o.g. let 

C Pj^, so Pi^ is a path from Ui^ to v which visits Ui^. Let P 21 C Pj^ 
be the path from Ui^ to ztjj. By the choice of B, there is no vertex in 
P 21 which belongs to Labeb^ except Ui^. Hence, {ui^,Ui^) G P“ i.e. 

there is an edge between and Ui^ in Gaif(u*(i*(T))). We get that B is 
a (fc — l)-clique and Gaif(u*(i*(T))) is a (fc — l)-tree. 

Now we are ready to prove Lemma [Tt] )!) and Lemma 
(1) Let T be an aligned fc-tree encoding such that u*(i*(T)) = 2t. Let To be 
obtained from the directed path with Wi € Label^° for 

^ A small but important note is that the definition of aligned k-tvees used throughout this 
appendix deviates slightly from the one given in Section]^ and used elsewhere in the paper. 
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each i G [k], pi G for each i G [A — 1], and root‘d = {wi} by attaching 

T as a child of Wk- By construction, Tq is perfect. By (o) we have that 
Gaif(u*(i*(To))) is a (fc — l)-tree. Deleting the vertices wi,pi,... ,pk-i,Wk 
and the edges incident to them from Gaif(u*(i*(To))) we get Gaif(u*(i*(T))), 
implying that Gaif(u*(i*(T))) is a partial fc-tree. 

(2) We prove this by induction on the construction of (fc — l)-trees. 

(a) Let G be a (fc — l)-clique with vertices Wi,... ,Wk-i- Let ^base be the di¬ 
rected path on the vertices wi,pi,... ,pk- 2 ,Wk-i such that Wi G Labelf'’”'" 
for all i G [fc — 1] and pi G Labely“,(J for all i G [fc — 2], We have that 
Gaif(u*(i*(T 6 ,,e))) = G. 

(b) Let Go be a (fc — l)-tree and let To be an aligned fc-tree encoding such 
that Gaif(u*(i*(To))) = Go- Let G = {ui,... ,Uk-i} be a (fc — l)-clique 
in Go- Let G be the graph obtained from Go by adding a new vertex v 
as well as edges between v and each of the vertices in G. Since G is a 
(fc — l)-clique in Go, for every distinct ii,i 2 G [fc — 1] there is a directed 
path in To between from Ui^ to Ui^ or vice versa. As a consequence, there 
is a path P in To from the root to some Ui such that ui ,..., Uk-i all occur 
on P. For every t G [fc — 1] and i G [fc] such that Ut G Label^“, Ut is the last 
vertex in P to belong to Labelf®. Since |G| = fc — 1, there is j G [fc] such 
that LabelJ® n G = 0. Since G is a clique in Gaif(u*(i*(To))), it must be 
the case that |Labelf° n G| = 1 for alH S [fc]\{j}- If ue has two children, 
let T be a subtree of To whose root is a child of U(. Let T be obtained 
from To by attaching a new child vertex Ubiank G Labeloiank lo ui, moving 
T to be a child of Ubiank, and adding u as a child of Ubiank- If ue is a leaf 
or has one child in To, let T be obtained from To by attaching u as a child 
of U£. Setting v to belong to Label^', we get Gaif(u*(i*(T))) = G. 

□ 

The translation schemes interpret, undecorate and structurizec- There 
are translation schemes interpret (shorthand i) for 7?. U V over V and undecorate 
(shorthand u) for TZ over TZUV which take an aligned fc-tree encoding to its dec¬ 
orated aligned fc-tree respectively a decorated aligned fc-tree to its aligned fc-tree. 
The induced transductions i* and u* are surjective with respect to the classes of 
decorated aligned fc-trees resp. aligned fc-trees. 

For every vocabulary C there is a translation scheme structurizec (shorthand 
Sc) which takes an aligned fc-tree whose elements are annotated with unary relations 
to a C-structure. The induced transduction sc is surjective with respect to the class 
of C-structures of tree-width less than fc. The binary relations of such structures 
are encoded inside aligned fc-trees by unary relations on the sources of Rj edges, 
using that the Rj are functions. The vocabulary consisting of these new unary 
relation symbols as well as the unary relation symbols of C is denoted by JVq ■ Sc is 
a translation scheme for C over TZUAfc- Fig-[^d) shows an aligned fc-tree annotated 
with the unary relations of JVc for C = (E ), where A is a binary relation symbol. 
Fig-i e) shows the directed graph obtained from (d) by applying sc. 

Lemma 18. 

A. : For every decorated aligned k-tree 21, i*(2t|v) = 21. 

B. : For every aligned k-tree 21 obtained from a decorated aligned k-tree 2lo, 
u*( 2 lo)= 2 l. 
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C.: There is a vocabulary J\fc consisting of unary relation symbols only, 
such that for every C-structure 2 JI, has tree-width at most k iff there is 
an aligned k-tree ‘Qiori and an expansion 21 of 2lori to JVc U TZ such that 
s*cm = m. 

Lemma follows from Lemma |18[ Lemma |16| and Lemma with T>bnd = 
U V, a' = encAi«(ul‘(s^(a))), and /3' = i#(u«(s“.(/3))). 


Proof of Lemma |18K A). i = ■ C & TV) is given as follows: 

(1) The universe A of 21 is T, i.e. ‘p{x) = [x ~ x). 

(2) The relations from V are copied without change, i.e. ftsix, y) = s{x,y), and 
f’Laheiix) = Label(a;) for each Label G {Labeli,..., Labels, Labelbiank}- 

(3) 'il)R-{x,y) = g}{x) A ^Labelj(a;) A Labelj(?/) A Vz (z 76 y —>• ^Label 7 (z)) A 
(dpath(a;, 2 /)) where dpath(a;, 2 /) says that there is a subset P of s which is 
a directed simple path from x to y: 


dpath(a;, y) = 3P C s \/z {in<i{P,x)) Aino(P,x) A 

Mz (m<i(P, y)) A outo{P, y) A reach(P, x, y) 

reach(P,a;,y) = 'iXCvi {X(x) A (y)) ^ 


ino{zi,P) 

outo{zi,P) 

in<i{zi,P) 

out<i{zi,P) 


(3zi3z2 iX{zi) A -^X(z2) A P(zi,Z2))) 


VZ 2 (^P(z 2 ,Zi)) 
VZ2 (^P(zi,Z2)) 
3^1z2 {P{z2,z,)) 
3^^Z2 {Pizi,Z 2 )) 


reach(P, x, y) says that y is reachable from x via P edges. 


Proof of Lemma |18K B). Let u = (tp, ,..., be as follows: g^[x) = 
^Labelbiank(a;) and for every iG [k], 'fR,{x,y) = R^{x,y). 

Proof of Lemma |18KC ). We prove the following lemma which spells out the two 
directions of Lemma[l^C): 

Lemma 19. For every vocabulary C, there is a set Afc of unary relation symbols 
and a translation scheme structurizec (shorthand sc ) forC over AfcGJTZ such that: 

(1) If VJl is a C structure whose Gaifman graph is a partial (k — l)-tree, then there 
is an expansion 21 of an aligned k-tree to Nc U TZ for which s(, (21) = DJt. 

(2) //2l is a {NciTTZ)-structure and^red = ‘^\ti is an aligned k-tree, then Gaif(s^(2t)) 
is a partial {k — l)-tree. 


For every vocabulary C, let Afc be the vocabulary extending the set un(C) of 
unary relation symbols in C by fresh unary relation symbols Ps^seif, Ubj, and 
Ub. invj for every binary relation symbol B G C and j G [fc]. Recall Vc = V U A/c 
and TZc = Vc U P U C. 

Since the Ri are functions, we use a fixed number of unary relation symbols 
to encode, for every universe element mi such that (mi,m 2 ) G Rf^, what other 
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relations of 971 do {mi,m2) and {m2,mi) belong to. The unary relation symbols 
are of the form Ubj and where j € [fc] and i? is a binary relation symbol. 

We use that tuples {mi, m2) which occur in any relation of 97t, occur also in 
or in {R™) Additional unary relation symbols UB,seii encode self loops {m,m) 
in 

Let Sc = {(fi, ipc ■ C G C) is the translation scheme given as follows. 

• (p{x) = {x ~ x). 

• For every unary relation symbol U G C, 4’u{x) = U{x). 

• For every binary relation symbol B G C, consists of all pairs {v,u) 

such that (a) {v, u) G Rj and v G or (b) {u, v) G Rj and u G 

This is given by 

tpB{x,y) = -^{x y) A \J {{Rj{x,y) AUb,j{x))V {Rj{y,x) AUB,inv,j{y))) 

76 [fc] 

V{x ^y) A C/b, seif(a^) 


Let 971 be a C-structure with universe M. Let 2to be an aligned k-tree 
such that Gaif(97t) = Gaif(2lo) guaranteed by Lemma 17 Let 2li be the 
expansion of 2to such that, for every B G C and j G [fc], 


^Bj = {^1 I (^1,^2) e Rf ns™} 

= {mi\{m2,mi)G{Rf-)~"fiB^] 

^B.seif = {m\{m,m) GB'^} 


Self loops are encoded by the relations of the form C/g . Consider a,h G 
M which are distinct. We divide into cases. 

— Assume {a, h) is not an edge of Gaif(97l). By Gaif(97t) = Gaif(2li), nei¬ 
ther {a,b) nor {h,a) belong to any Hence, 2ti ^ '4’B{a,b), 

i.e. (a, b) ^ ^ fQj. g^jjy binary relation symbol B G C. 

— Assume {a,b) is an edge of Gaif(97t). By Gaif(97t) = Gaif(2lo), either 
(a, b) or {b, a) belong to some . We have: (a, b) G B^ iff 

exists j € [k] such that [{a,b) G Rf^ and a G or [(&, a) G RJ^ 

and b G iff 2li |= V'B(a, b) iff (a, b) G 

In both cases {a,b) G B™ iff {a,b) G We get that 971 = sj(2li) 

and (1) follows. 

Now we turn to (2). By Lemma[T^l), the Gaifman graph of 21 is a partial (fc—1)- 
tree. By definition of ipB, for every binary B G C, c lJjG[fe] 

implying that Gaif(s^(2l)) is a partial (fc — l)-tree. 


The translation scheme trc: 

from structures of bounded tree-vifidth to labeled trees. Let ic and uc be 

the translation schemes which extend i and u respectively as follows. The unary 
relations C of Nc are additionally defined under ic and uc to be tpc{x) = C{x). 
Hence, ic is a translation scheme for 77. U V U Afc over V U Afc, uc is a translation 
scheme for 77 U Nc over 77 U V U Nc , and sc is a translation scheme for C over 
77 U Nc . 
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Before we define trc, we need to introduce the notion of composition of trans¬ 
lation schemes. Let Cq, Ci and C2 be vocabularies. Let Ci) be a 

translation scheme for Ci over Ci-i. The composition of ti and t 2 , denoted tiot 2 , 
is the translation scheme given by t = ■ C £ Ci) such that 

(1) 0(x) = (ti)*‘((t2)“‘(a; « a:)) 

(2) For every C e un(C), ^c{x) = (ti)“((t2)“(C(s))) 

(3) For every C S bin(C), i/'c = (t^)**((t^)‘*(C'(a;, j/))) 

The translation scheme tc is the composition of ic, uc and sc, i.e. tc = ic o (uc o 
Sc). tc is a translation scheme for C over V U fCc ■ As a consequence of Lemma [I^ 
we have: 


Lemma 20. There is a vocabulary Me consisting of unary relation symbols 
only, such that for every C-structure 971, 971 has tree-width at most k iff 
there is an V U Me -structure 21 such that tf, (21) = 971 and 21 is an aligned 
k-tree encoding. 


Proof of Lemma |l4| , Let Cbnd and Cunt be vocabularies such that CbndC]Cunb only 
contains unary relation symbols. Let Vbnd = We define tr which extends 

tc6„d as follows: For every C G Cunb\Cbnd, let ifcix) = C{x) or 'ii)c{x,y) = C{x,y). 
I.e. tr copies the relations of Cunb\Cbnd from the source structure to the target 
structure without change (except taking the projection to the universe of the target 


structure imposed by the formula (j){x)). Let dom = enc from Lemma 16 

(a) (j) is quantifier-free since tc is obtained from the composition of translation 
schemes in which (f is quantifier-free. 

(b) ipc is quantifier-free for every C G Cunb\Cbnd by the definition in this subsection. 
ipe is quantifier-free for every C G un(Cf,„d) since we have in fact ipc{x) = C{x); 
this is true because it is true for each of ic, uc and sc. 

(c) For every C G Cbnd, ipc is defined in the translation scheme trci,„^. Since trct„j 
is a translation scheme for Cbnd over V ^ Me^^d = T^bnd, f’e S MSO(I7b„d). 

(d) This item follows from Lemma 20 and using that dom = enc defines the class 
of aligned fc-tree encodings on structures 21 in which s® is a binary tree. 


8.3. Proof of Lemma |15[ For a leaf b, hing(Tb) depends only on the unary re¬ 
lations which b satisfies. By Theorem for a vertex b with one child bo (two 
children bo, bi), hinq(Tf,) depends only on the unary relations which b satisfies and 
on hin,(Tf,J (on hinq(Tf,J and hin,(T,,J). 

Let 

0c“ = part A leaves A intsi A ints 2 

where part says that {C^ : e G ^^^e,q\ partition the universe, and leaves, intSi, 
and intS 2 define the for the leaves respectively the internal vertices of Tq with 
one or two children. 

We give part, leaves, intsi and ints 2 below. There are formulas leaf(a:), 
inti(a;), and int 2 (a:), which express that a: is a leaf, has one child, or has two 
children, respectively. Let 



CPix) A 


A 

^ c,, 


hCei(x) V -^Ce^ix)) 


part = Vx 
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For U C un(C), let Qu be a C-structure with universe O of size 1 satisfying that 
= 0 iff [/ & and root^“ = O. Let 


intsi = Vx I inti(a;) —> ((this;^(x) A childc^(x)) —>■ (x)) 1 

\ W,ei.e 2 / 

thiS;^(x) = f\ 

C7Gun(C) 

childejy) = 3ys[x,y)^C^^{y) 

/\u Cl £2 El, £2 S and U C un(C) such that for every structure 

21 G J^oot with hin(2t) = € 2 , hin(D;^ ^ 21) = ei, The notation stands for Y if 
X is false, and for -^Y otherwise. 

We define 


leaves = Vx 


leaf(x) —> /y thisiy(x) —>■ C^^{x) 


U,ei 


/\jj ranges over ei G JY’j^^c,q and U C un(C), where hin(DiY) = ei. 
Finally, intS 2 is defined as follows: 


intS 2 

intdist(x) 


intsame(x) 


childrene^ (x) 


Vx(int 2 (x) —>■ (intdist(x) A intsame)) 

/y ((this;^(x) A childe 2 (x) A childe 3 (x)) —>■ C'ei(x)) 

U,ei ,62,63 

/y ((this;./(x) A childrenj^ (x)) —>■ (x)) 

Z//,6i ,62 

3=^y{s{x,y) AC^^iy)) 


f\u Cl £2 £3 intdist(x) ranges over 61 , 62,63 G and U C un(C) such that 

£2 £ 3 , and for every two structure 21, IB G ^oot with hin(2l) = 62 and hin(lB) = 

63 , hin((D;^ o 21) o 25) = 61 . /\^ in intsaine(x) ranges over 61,62 G J^^^c,q 

and U C un(C) such that for every structure 21 G .Xfroot with hin(2l) = 62 , hin(D;^ 'o 
21 ) = 61 . 

By definition, G C^. To is a C-structure in which s is interpreted as a binary 
tree. Let Ti be the expansion of To such that, for every element b of the universe 
Ti = To of Ti, 6 G where ‘lb is the subtree of To rooted at b. In particular, 

for the root r of Tq, r G have Ti |= 0^“) and hence (i) 

holds. Since Ti is the only expansion of To which satisfies 0^1™, (h) holds. 


8.4. Proof of Lemmaj^ We assume 2-tp(S[lli) = 2-tp(9Jl2). Let ui, xi G M be two 
elements and let A = 2-fp®^^ (mi, xi). Then there are two elements U 2 ,V 2 G M such 
that A = 2-ip^'^{u2,V2) because of 2-tp(9Jti) = 2-tp(9}t2). Thus, IHi ^ x(ui,xi) 
implies 9Jt2 \= x(Y 2 ,V 2 )- Because mi,xi have been chosen arbitrarily, we get that 
21ti \= (j) implies 9 JI 2 \= (j). The other direction is symmetric. 


8.5. Proof of Lemma |6l 

Proof. Let z = 2|Tp + 1. We set colors(T) = {Ai \ l < i < z}, where the Ai are 
fresh unary relation symbols. 
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Let SJt be a -structure with universe M. By Lemma there is a proper 
z-coloring of G^. We expand SEJl to a {Cunb Ucolors(T))-structure £ as follows: for 
all elements u G M we have Af{u) iff u has color i in the proper z-coloring of G^. 
Clearly, £ is a chromatic {Cunb U colors (T))-structure. □ 


8.6. Proof of Lemma We proceed by induction on the number of vertices 
n = \V\. The case n = 1 is trivial. Assume that n > 1 and that the lemma holds 
for n—1. We have J2vev ^ • J2vev deg~^{v) < 2nk. Thus there is a vertex 

V G V with deg{v) < 2k. By the induction hypothesis, the graph G—v has a proper 
2k + 1-coloring. Because of deg{v) < 2k the proper 2k -|- 1-coloring of G—v can be 
extended to a proper 2k + 1-coloring of G. 


L7. Proof of Lemma 


10 


Let V G M he an element with 2-tp’^l^ (m, u) = A. We 
get £ ^ P\{u) because of £ |= C,unb- Because of l-lp^(u) = 1-tp^ (u) we have 
£' ^ Px{u). Because of £' ^ Chnd we have A = 2-tp^ ^^{u,w) for some w G M. 
ranku{£,, 2') = 0 implies that £' \= Ri{u,v). Because of £' ^ 5 we get v = w. In 
the same way one can show that 2-ip'^ {u, v) = X implies 2-ip'^l® (u, v) = A. 


8.8. Proof of Lemmaj^ We show £|ci,„j = ^'\cbnd- hetu,vG M with £ \= B{u,v) 
for some B G bin(C;,„d). Because of £ ^ 0 we have £ \= Ri{u, u) or £ ^ Ri{v, u) for 
some i G [/c]. Because of rank{£,,2') = 0 we have rankl^{2, £,') = 0. By Lemma 


10 


we have 2-tp^^‘^ {u,v) = 2-ip^ Thus, £' ^ B{u,v). In the same way one 

can show that £' \= B{u,v) implies £ ^ B{u,v). Thus, £|ci,„d = ^'Ictnd- 

We show Gaif(£|cj„d) = Gaif(£'): We have that Gaif(£') = Gaif(£'|K) because 
of £' ^ 0 and £' |= e. We have Gaif(£|ci,„j) = Gaif(£|7^) because of £ |= 9 . We 
have Gaif(£'|77,) = Gaif(£|77,) because of rank{£,,£') = 0. Thus, the claim follows. 


8.9. The induced translation t®. We spell out the inductive definition of the 
induced translation. 

Let Co and Ci be vocabularies. Given a translation scheme t =< </>, ipc '■ C gCq > 
for Co over Ci we define the induced translation t^* to be a function from MSO(Co)- 
formulas to Ci-formulas inductively as follows: 

(1) For G G un(Co) or for monadic second order variables G, and for 9 = G{x), 
we put 

ttt(6») = -ipc{x) A (l}{x) 

(2) For G G bin(Co) and 9 = G{x,y), we put 

t“(^) = y) A (l){x) A (j){y) 

(3) For X ^ y,we put 

t'^{9) = X « A(j){x) A (j){y) 

(4) For the Boolean connectives the translation distributes, i.e. 

• a 9 — 9i\/ 92 then 

t»(0) = (t*(0l)Vt«(02)) 

• ii 9 = ^9i then 
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(5) For the existential quantifiers, we relativize to (j>: 

If 0 = 01, we put 

t“(0) = At“‘(0i)) 

If 0 = 317 01, we put 

t#(0) = 3C/(t»(0i)AVyC/(y)^.^(2/)) 

We have somewhat simplified the presentation in [ini Definition 2.3] to fit our 
setting. 

TU Vienna 

E-mail address: {kotek.zulegerSforsyte.at} 


